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1. Introduction 

In this report* we discuss relational programming . i.e. a style of programming in 
which entire relations are manipulated rather than individual data. This is analogous 
to functional programming [Backus78], wherein entire functions are the values mani- 
pulated by the operators. We wlU see that relational programming subsumes functional 
programming because every function is also a relation. It is appropriate at this point 
to discuss why we have chosen to investigate relational programming®. 

As we have noted, relational programming subsumes functional programming: 
hence, anything that can be done with functional programming can be done v.lth rela- 
tional programming. Furthermore, relational programming has many of the advan- 
tages of functional programming: for instance, the ability to derive and manipulate 
programs by algebraic manipulation. A well developed algebra of relations dates back 
to Boole's original work and has been extensively studied since then. Although rela- 
tions are more general than functions, their laws are often simpler. For insteince, 
if g)~^ = S’"*./”* is true for all relations, but true only for functions that are one-to- 
one. Also, relational programming more directly supports non-linear data structures, 
such as trees and graphs, than does functional programming. In relational program- 
ming the basic data values are themselves relations, whereas in functional program- 
ming there is a separate class of objects (lists) used for data structures. On.8 final re \- 
son for investigating relational programming is that it provides a possible paradigm f-. 
utilizing associative and active memories. As a teaser for what is to come, ve presc . 
the folio-wing example of a relational program. We v/ill take a text T . represented as 
array of words (i.e., T{i) is the i-th word), and generate a frequency table F so la ' 
F{w) is the number of occurences of word w in T. Now we will see (§4) that all T {m) il 
the set of all indices of the word w. If we let size(C) be the cardinality of a set C, then 

1. The work reported herein was supported by the Office of Naval Research Tinder contract nTimber NOOOl 4- 
62-VrR-20162, and by the Foundation Research Program of the Naval Postgraduate School vrlth funds 
provided by the Chief of Naval Research. 

2. The reader can find a shorter introduction to relational programming in [HacLennan83]. That report is a 
revision and extension of [MacLennanBla] and [MacLennan61b]. 
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the number of indices (occurences) of w is just size[all T Therefore we can write 

F = size. (all T) (§7). 

2. Classes and Relations 

2. 1 basic concepts 

Our relational calculus will deal with three sorts of things: individuals, classes and 
relations. These can best be illustrated by example. If 'x' is the name of an individual 
and 'C is the name of a class (set), then ‘xeC means that the individual denoted by 
‘x’ is a member of the class denoted by ' C (i.e., that x has property C). Thus 
'Aristotle eman' would indicate that Aristotle is a man, euid 'Seeven' would mean that 2 
is an even number. Some authors (e.g., Russell and Whitehead) use 'xtP' for 'x^P'. 
The symbol and its alternate 'a' are abbreviations for 'tan' , which is the Greek v/ord 
for 'is'. 

If 'x' and 'y' are names of individuals and ' R’ is the name of a relation, then 'x R y' 
meeins that x bears the relation R \.oy . For example, 

Aristotle student Plato 

means that Aristotle is a student of Plato. Also, '2 < 3' means that 2 bears the less- 
than relation to 3. A relation is just a set of pciirs. Therefore, if we use x\y to denote 
the basic pair-making operation, VnenxRy if and only if x:y ^ R. The notation that we 
have introduced above will be extended to classes of classes, classes of relations, rela- 
tions among classes, relations among relations, etc. 

2.2 relational descriptions 

There are several ways to describe classes and relations. One of the easiest is to list 
its elements, for example: 

S = n. 3, 5, 7 j 
R - ^ l:a, 2:b, 3:c, 4:d \ 
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This is called an extensional description of the class. Obviously, this is only possible if 
the class or relation is finite and only practical if it’s small. Therefore v/e also have 
intensional descriptions of classes and relations. 

If S{x) is a sentence involving 'x\ then a class description is an expression of the 
form '\x |5'(x)j'. This denotes the class of all individuals, a, for which S{a) is true, i.e., 

ae^x|5'(x)j 5'(a) 

Similarly, if S{x,y) is a sentence involving 'x' and then '\x\y\S{x,y)\' is o. relation 
description which describes the relation that holds between a and 6 whenever S{a,b) 
is true, i.e., 

a\x\y\S{x,y)\b S{a,b) 

To illustrate this notation we will define the converse of a relation. 

2.3 converse 

The relation is called the converse of R , i.e. xR~^y yRx. Using our notation 
for descriptions we can define R~^ - \x:y \ yRx\. As an example of a relation among 
relations, we define ‘inv’ as the relation that holds between converses: 
s invr r-s~^. Hence, inv = | Some examples of converses are 

parent'’^ = child and 

EXERCISES: Prove the following properties of the converse: 

^ 

r inv s <=> s inv r 
inv“^ = inv 

2.4 arrow diagrams 

Relations can be portrayed by G7ro^D diagrams (Haase diagran^s). In such a 
diagram there is a node for each individual related by the relation and an arrov/ from rj 
to y whenever xRy . For instance. 



- 3 - 



R = 



KELATIONAL PROGRAMMING 

C 



a *<7 





represents the relation R such that bRa, cRb, dRb , ^Rd, sRe , bRe and -xRy for all 
other cases: 

R = { b:a, c:b, d:b, e:d, e:e, b:e \ 

The effect of the converse operator is to reverse all of the arrows. Hence, R~^ is 
diagrammed: 

= a 




2.5 tables 

Relations can often be viewed as tables. For instance, the relation R of the previous 
section can be shown as the table in Figure 1. 







figure 1. Relation \^ewed as a Table 

Of course, it makes no difference in what order we write the rov/s of the table. 

The converse of a relation is obtained by simply exchanging the columns of the table 
(see Figure 2). Of course, classes are represented by one column tables. For instance 
the class C of primes less than ten is shown in Figure 3. 



3. Domains 

We often need to talk of the individuals that can occur on the right or left of a rela- 
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Figure 2. Converse of a Relation 

tion. We say that x is a left-member of R whenever there is a such that xRy . 

X Lm R <=> By{xRy) 

For instance, if ‘x parent^’ means that x is a parent of y, then ‘Socrates Lm parent’ 
means that Socrates is a parent. Riyht^embeT and member are defined analogously: 

y Rm R 3x {xRy ) 
z Mm R <r> z Lm R \/ z Rm R 

EXERCISES: Prove that these satisfy the identities: 

xLm/? xRmL?"* 
yRm/? «» yLm/?~' 



A. Functions 
4. 1 basic concepts 

Functions and relations are closely related. Consider the successor relation, 'succ': 
X succ y x-i-l = y . Thus, x succ y says that x’s successor is y . The corresponding 
arrow diagram is: 

1 2 3 4 5 ... 

< — ► » >«> * > • 



and the corresponding table is shown in Figure 4. since 1 succ 2, 2 succ 3, etc. Notice 
that, in this case, for each left member x there is a unique right member y such that 
X succy. This y can be written using Whitehead and Russell’s [WhiteheadTOj definite 
description: 
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_2 

_3 

_5 

Figure 3. Set Viewed as a Table 
iy{x succ y) 

This can be read; the y such that x's successor is y. A more convenient way to write 
this is succ(x). In general, R{x) means ‘the unique y such that x R y\ i.e. 
R{x) = iy{xRy). When no confusion will result we ^m^e Rx instead of R{x). This nota- 
tion is left-associative, that is, Fxy = {Fx)y. When we need to make the application 
operation explicit we write R@x {R at x or /? applied to x) for Rx . 

The functional notation is meaningful only if there is a unique y such that xRy, i.e. 
xRy /\xRz 'Dy=z. That is, there is only one arrow leading from x. When this condi- 
tion is satisfied for all x we call R right univalent, symbolized by ‘run’: 

7?erun \/xyz[ xRy /\ xRz Dy = z ] 

The right univalent relations are more commonly called functions. In a left univalent 
relation there is exactly one arrow leading to each node. Consider the ‘absolute 
reciprocal’ relation: xRy y= |l/x|. This is diagrammed in Figure 5. Since 
/?erun it is meaningful to write R{x), so we observe /?(-3) = 1/3. We can find Rx by 
following the arrow pointing from x or by looking down the left column for x and taking 
the corresponding element from the right column 

The concepts of left univalence and bi-univalence are defined analogously: 

/?€lun <=> Vxyz [ yRx A zRx d y ] 

/?ebun <=> /?elun A /?erun 

Bi-univalent relations are also called bijections and one-one mappings. 
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1 


2 


2 


3 


3 


4 


4 


5 







Pigure 4. Function Viewed as a Table 

4.2 higher level functions 

Of course, the converse of a function is not necesseirily a function. The 'sin' relation, 
defined so that x sini/ means that y is the sine of 2 , is right univalent but not left 
univalent. Hence, we can write either T/ = sin 2 or x sin y, but can express the arcsine 
only by y sin“* x. The notation sin~* y is meaningless. Since / (x) is meaningful only 
when / erun we will be careful to write f (x) only when we have previously shown (or it 
is obvious) that / erun and x Lm / . 

The fact that F{x) may be meaningless makes it convenient to use several other 
relations derived from F. One of these is the image. If F is any relation and C is a 
class then img F C is the set of all y such that xF\/ for some x in C, i.e., 

img F = \C:z \ z = \y\ '^{xFy A xeC)|| 

The tabular interpretation of imgFC is shown in Figure 6. We see that, if F is any func- 
tion, then imgFS is the image of the class S under that function. Notice that the 
operation imgFS’ is defined for all relations F and classes S, regardless of whether 
Ferun or the members of S are left members of F. For these reasons, it is generally 
Self er to write imgFC than Fx . 

The image operation is also useful for working with relations. For exam.ple, 
img.inv(<)S is the set of all numbers that are less than some element of S. The rever- 
sal of the sense of the ordering occurs because img(>)S is the set of all y such that for 
some xeS', x>y . Thus img.inv(<)S' is the set of all x such that for some y^S , x<y . 

Related ideas are the image and converse image of an individual. If F is a relation, 
then c = unimgFx means that c is the class of individuals related to x. This class is 
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1 


1 


-1 


1 


2 


1/2 


-2 


1/2 


3 


1/3 


-3 


1/3 







Figure 5. Right-Univalent Relation Viewed as Table 
called the unit image of x, and is defined unimg R x = | xRy\. Alternately \ie can 

define unimgRx = imgR\xl. 

The converse idea is that of the inverse unit image of y : 

unimg. inv R y - \x\ xRy\ 

Like the image, unimg/? and unimg. inv/? are defined for all R and all arguments. 

We can also apply the unit image operations to general relations. Therefore 
unimg. inv(<):c is the set of all numbers less them x. This is sufficiently common that 
we define all=unimg.inv. Then alKx is the set of all numbers less than x. 

Next consider the function all(=): 

all(=)a: = \y\y=x\ 

Hence. all=x is just the unit class containing x, which we will abbreviate this unx. 
Conversely, if C is a single element class, then un“*C selects the unique member of 
that class: un“*C = tx(xeC). It is thus a uniqueness filter. We will write this as iJC 
where i5 = un“*. The expression i9C can be read ‘the C.' 

EXERCISES: Show the following; 

unimg = unimg, inv/? 
unimg. inv /?■* = unimg/? 
unimg/?!/ = img/?(uny) 

It is often convenient to have names tor domain extracting functions, e.g., dom R is 
the class of left members of /?, These are simply defined using images; 
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Figure 6. Image Operation Applied to Tables 
dom = all Lm 

dom.inv = all Rm 

mem = all Mm 



Of course the left members (domain) and right members (domain-inverse) of a relation 
can be obtained by taking its left and right columns, respectively, and deleting dupli- 
cates (Figure 7). 

R 



1/ \ 

dom i? dom.inv/? 

Figure 7. Domain Extracting Operators 

5. Boolean Operations 



5.1 logical connectives 

We will next investigate ways of combining relations and classes. The simplest 
methods are just abstractions of the logical connectives used between propositions: 
Therefore, we define the intersection, union, negation eind difference of classes and 
relations: 



x'Z.{Sr\T) -se- xe5AxeT 
xe(5 u T) xe5 V xeT 
xe(~5) -’(xe5) 
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xc{S\T) xeS ^(x^T) 

x^(SoT) xeSDxeT 

As an example of the use of these operations, consider our previous definition of Mm; 

z Mm R z Lm R \/ z Rm R 

Using the union operation this can be written Mm = Lm u Rm. Similarly, bun = lun n 
run. The logical connectives satisfy the usual properties of a Boolean algebra (e.g., 
DeMorgan’s theorem). 

As an example of the use of these operations, we will define the closed interval func- 
tion, [m. .n], which is the set of integers m, m+ 1 n. It is just: 

[m. ,n] = all>m n ali<n 

where < and > are the relations on integers. In general we will allow [m. .n] for an}^ 
types on which a strict order is defined. 

EXERCISES: Define the analogous notations {m. n), [m. .n), and {ra..n\ 

5.2 empty class 

It is useful to have a name for the empty class: ^ = 5\5, for any set 5. Hence, 
xe0 is always false. This is most often used for stating properties of relations and 
classes. For instance, S r\ T = (f> means that classes 5 and T have no members in 
common. 

The universal class is also useful: O = ~0. For instance, S u T = O means that 
every individual is either a member of S or of T. Notice that the class of the right 
members of a relation is just the image of the universe under that relation, i.e., 

dom.invT? = img/?0 
dom R = img/?”'0 
mem R = img(A’ u R~^)0 

EXERCISES: Prove these properties of the domain functions. 
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5.3 Cartesiaii product 

It is often useful to have the maximum relation that can hold between two classes, 
i.e., the Cartesian product of those classes. This is defined: 

Sy.T = [x-.y IxeS’AT/eT'j 

EXERCISES: Show the Cartesian product satisfies the following properties; 

(sxf)“* = fxs 
dom(sxf) = s 
dom.inv(sxf) = t 
mem(sx^ ) = s u f 

sx(fnii) = (sxi) n (sxit) 
sx(f \j u) = (sxf) u (sxii) 
sx(t \u) - (sxt) \ (s X ~u) 
sx(t Du) = (s X u (sxu) 

SX0 = (pxs = 0X0 = 0 
sxt = (sxo)r\(Oxt) 

5.4 subset relation 

Finally, we define the subclass operation: 

ScT Vx(xe5 3xeT) 

EXERCISES: Show the following are true: 

sQt D {sxu)c.{txu) 

s<zt D {rxs)c.{rxt) 

sQt /\uQu D (sXl/,)c(f xu) 

6. Limiting emd Restriction 

It is often useful to limit the left or right domain of a relation. Consider the relation 



- 11 - 



RELATIONAL PROGRAMMING 



y sin ’ X, which means that x is an arcsine cA y . Wc cannot write x = sin“'y because 
sin“* is not right univalent (i.e. it is not a function). If we restrict y, the eirgument of 
sin, to the range -tt/4 to tt/A. then there is a unique x such that y sin“* x. Let S be 
the class of reeds in the range — rr/4 to tt/4; 

S = (—rr/4. .tt/4] = all>(-rr/4) n all^(7r/ 4) 

then we will write 5'-»sin for the sine function with its arguments restricted to S . This 
function is bi-univalent, so it is invertible. If we call the inverse of this restricted sine 
Arcsin; 



Arcsin = (^-^sin) ' 

then it is perfectly meaningful to write Arcsin x (if x Rm sin). The left-restriction 
operation is defined: 

x{S~*R)y <=> x^S/\xRy 



In other words. 



y - {S-*R)x y - Rx A xg5 
The right-restriction is defined analogously: 

x{R^S)y <=> xRy /\y^S 

These notations can be combined to restrict both domains: 



X {S ^R*-T) y -6^ xg5 A zRy /\y^T 

The combination s^R<^s is so common that a special notation is provided for it: 
R'ts = s-^R<-s. For instance, <tF, where xgP <»x>0, is the less-than relation res- 
tricted to positive numbers. Notice that x succ y if and only if y is the successor of x. 
Therefore we can define the sequence of integers (m,m + l, . . . ,n) by restricting the 
succ relation: 



[tti. .n] = all5;m succ all<n 

See Figure 8 for a tabular representation of the domain restricting operations. 
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Figure B. Domain Restricting Operations 

EXERCISES: Show that the restriction operations can be defined in terms of inter- 
section and Cartesian product: 

s^r<-t = r n {sxt) 
rts = r n (sxs) 
s->r = r r\ (sxO) 
r^s = r n (Oxs) 

EXERCISES: Show that other properties satisfied by these operations are; 

sxt = s 0x0 
dom(s-^r) = s n dom(r) 
dom.inv(r <-s) = s ndom.inv(r) 
dom(r«-s) = img(r~‘)s 
dom.inv(s->r) = img r s 

{s 

r<-s nr^t = r^{s r\ t) 
r*-s ur<-t - r<-{s u t) 

{rxO)<-s = rxs 

7. Relative Product and Composition 

If xsony is the relation 'x is a son of y' and xbrothen/ is the relation ‘x is a brother 
cf y\ then the relative product, ‘son| brother’, is the relation 'a son of a brother of. 
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More formally, 

/?|5 = lx :2 I 3y(x/i)/ /\ ySz)i 

We will also write S./? and for A" 1 5. The reason for this is that if F amd G are func- 
tions it is easy to see that F. G is the composition of these functions: 

z = F.G X X F.G z 
«=> xG I Fz 

3y{x0y /\yFz) 

<=> 3y[z=Fy /\y = Gx] 

•$=> z=F{Gx) 

Hence, F.G X = F{Gx). 

It is convenient to have a notation for relative products of a relation vdth itself. For 
instance, the 'grandparent' relation can be written ‘parent | parent’, which we abbrevi- 
ate parent®. In general, 

F° = (=)t(mem R) 

= R 

+ i = (/?")|F = F1(F”) 

R-n ^ (/?")-! 

EXERCISES: Show these obvious properties of the relative product; 

(r.s).i = r.{s.t) 
r. (s u f ) = T.s u r.< 

{r yj s).t = r.t u s.t 
r.{s r\ t) c T.s r\T.t 
(r n s).t Q r.t n s.t 
5(r.s) 3(dom.inv r n dom s) 
where 3r means 3x[xe7-] 



- 14 - 



RELATIONAL PROGRAMMING 



(r *) * = r 

(r.s)-i = (s-‘).(r-i) 

rjr.m J.n _ <y,m+n (^jn ,n>:0) 

(r^)T^ _ ^mn ( 771 , 7x>o, or rebun) 

,r'^ C (rebun) 

T.r~^ — r'^r = r^ (rebun) 

dom(r.s) Q domr 
dom.inv(r.s) c dom.invs 
Lm = Rm | inv 
Rm = Lm | inv 

r. (0x0) = (0x0). r = 0x0 

r. Id = Id.r = r where Id = (=) (the identity function) 



8. Structures 

We have previously seen the use of arrow diagrams to represent a relation. For 
instance, the diagram in Figure 9 represents the relation R shown in Figure 10. 




8. 1 initial and termincil members 

Now, notice that the domain (left) and codomain (right) members of R are; 

dom R = ^ a. b. c. d. e, f, g { 
dom.inv R = ^ g. f. e. d, i, h | 

We define the iixitial members of R to be those members which are not pointed at by 
an arrow. Therefore, the initial members of R are the left members that are not right 
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R 



a 


A 


b 


f 


c 


e 


d 


d 


d 


e 


e 


i 


f 


f 
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i 


R 


f 




h 



Figure 10. Tabular Representation of a Structure 
members, that is, the domain members that are not codomain members. 

init /? = dom(/?) \ dom.inv(/?) = ^a. b, cj 

The terminal members of a relation are defined analogously: 

term/? = dom.inv(/?) \ dom(/?) = \h, i] 

When a relation is used to represent a data structure, the above functions become 
important. 

For instance, a sequence is represented by a relation with the structure: 

S — fij fig (I 3 

> 

In this case init S is the unit class containing the head (first element) of the relation 
(i.e., tti) and term S is the unit class containing the last element of the sequence (i.e., 
a^). Similarly, ('^.init S)-^S is the sequence with its first element deleted: 

Hence, the following common sequence manipulation functions can be defined: 



a S = i3.init S 


first 


u S = i3.term S 


last 


ns = (~.init5)->5 


final 
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AS - 5<-(~.term 5) initial 

EXERCISES: Prove the following properties of these relations: 

a = cj.inv 
cj - a.inv 
A.inv = inv.n 
n.inv = inv.A 



More operations on sequences are discussed in the next section. 



As another example of the use of ‘init’ and 'term', consider the relation, represent- 




Plgure 11 . Relation Representing a Tree 

Then, aT is ‘a’, the root of the tree, and term T is jd, h, i, f, j, k], the leaves of the tree. 
The result is analogous for forests. { ^ 




Figure 12. Relation Representing a Forest 

Given the relation in Figure 12, the set of roots is init F and the set of leaves is term F: 
init F = {a.i.gi 

term F = {c,e,f.g.h,j,k.l.m,n.t.u.v.wj 
8.2 higher level operations 

The set of nodes whose parent is n is just imgF>i. For instance, the set of nodes 
directly descended from a root is 
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(img F).init F = {b.hj.o.p.r} 

The set of nodes that point to leaves is 

(img F~^).lerm F = (b.d.a.i.o.p.s.rj 

These operations can be used for obtaining the maximum and minimum of sets. 
Suppose ’<’ is the less'than relation on integers and S is some set of integers, say 
{3,5,9}. Then 




3 5 9 



Now note that init(<T5) = \3] and term(<T5) = \9]. Hence, if 5 is any set of 
numbers, then the minimum and maximum of this set are: 

min 5 = a(<T5) 
max 5 = cj(<t5) 

Notice that we can select the maximum and minimum based on any relation that is a 
series (i.e., transitive, irreflexive and connected). If R is any series then a(i?T5) is the 
minimum (relative to R) and cj(Rl'S) is the maximum. 

EXERCISES: Show that the following are properties of these operations: 

init r = term(r“^) 
termr = init(7'“^) 
init c dom 
term C dom.inv 
init(rTs) = term(r“^Ts) 

init(r us) c init r u init s 
init r n init s c init(r ns) 
term(r u s) c term r u term s 
term r n term s c term(r ns) 
init(sxi) = sV 
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term(sx^) = init(sx^) ' = init(^xs) = ^\s 

9. Sequences 

9. 1 ordinal couples 

In this section we will continue the discussion of sequences begun in the last section. 
We saw that it was easy to define the following operations on sequences; 

a 5 = T^.init 5 
o S = T^.term S 
AS = 5^('^.term 5) 

QS = (-.init5)->5 

This provides us with functions for taking sequences apart. We will define the ordinal 
couple or pair, which puts them together. If x and y are two objects, then '{x.y)' is 
the relation that relates x and y but no other objects. 

X y 

That is, u{x ,y)v if and only if u=x and y=v . This is formally defined by: 

x,y = \u\v \u^x/\v-y] - un(x) x un(?/) = un(:r: 2 /) 

Notice that x \y = Observe also that a{x ,y) - x and o{x ,y) = y. Finally, 

xRy {x,y)QR. 

Explicit relations can be described by a combination of the pair and union opera- 
tions. For example, we have the identity: 

\ X2.y2 ^n-Vn I = i^2.yz) U • ■ • U 

We will define a convenient notation for sequences of two or more elements; 

{ Xi. X2, . . . . x^) = \xi:xz. Xz:x3 

Therefore the sequence (a,b,c,d,e) is just 
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» X — > » ^ — >• 

abode 

9.2 catenation and consing 

If s and t are sequences then we can define an operation which is the catena- 

tion of s and t . To form this catenation we must hook the last element of s to the first 
element of t : 




Therefore ]i/ if and only M x s y , or x t y , ov x =cj s and y-a t . Hence, 

sH = s u {o s , a t) u t 

The catenation operation is only defined for sequences, which are required to have at 
least two elements (since an irreflexive relation with less than two elements is the 
empty relation). This issue is discussed in the following section. 

How then do we add a single element to the left or right of a sequence? The 'cons 
left' and 'cons right’ operations are easy to define: 




a: cl Si Sti ^ Sj s, 

xcls = (a:,as)us 
s cr y = s u {co s , y) 



If 5* is a sequence and xMmS, then Sx is the successor of x in 5 and S~^x is the 
predecessor of x in S' (if these exist). 



Sx = successor of x in S 
S“^x = predecessor of x in S 



These are convenient ways of moving around within a sequence. Also, note that if s is a 
subsequence of t thenscf . Some additional identities are: 
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{x,y)\{y,z) = (x,z) 

itng a (5 XT’) = S’ 
img a (SxT) = T 

EXERCISES: Show that if s is a sequence, then: 

a{x cl s) = X 
n(x cl s) = s 

q(s cry) = y 
A(s cr y) = s 

(a s) cl (n s ) = s , if size s > 2 
(As)cr(ws) = s, if size s > 2 

Also, if s is a sequence, show that s u (cj s , a s) is a ring formed by joining the last ele- 
ment of s to the first element. 

If s is a sequence, then s“* is the reverse of s. Hence, revs = s~L Show the follow- 
ing; 



as -us 


-1 


us = a s 


-1 


As = (Qs *) * 


Os = (As“*)“* 


(s-0-‘ = 


t ^ '^S ^ 


(x cl s)“* 


= cr X 


(s cr x)“' 


= X cl 


(x,y)"> = 


(y-^) 


( 3 ^ 2 . • • 


1 

I 

II 
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|x,:T/,, Xz.yz. ■ . ^n Vn\ ' = \yi-X\~ VZ-^Z Vn'^n] 

9.3 alternative definitions of sequencers 

We will state the formal definition of a sequence: a relation is a sequence if it is a 
connected irreflexive bijection. That is, 

sequence = connex n irrefl n bun 

seirrefl c 

sEconnex dom s = init s u dom A dom = term s u dom s 

Although the preceding definition of sequences is very convenient, it has a number of 
limitations. For example, the operations discussed above are only defined for 
sequences with two or more elements, since an irreflexive relation cannot relate less 
than two elements. In particular, (2) = 0. One solution to this problem is to use a stan- 
dard “end marker" for all sequences, say ‘EOF’. For example, the sequence 1, 3, 5 
would be represented by the relation (1,3, 5, EOF). A one element sequence containing 3 
would be represented by (3, EOF) and an empty sequence by (EOF) = () = 0. This 
definition has some curious properties of its own. For example, the relation 
(3, 3, EOF) = ^3:3, 3:E0Fj has no initial members and in fact is not a sequence (since it’s 
not irreflexive). Of course this objection also applies to our original definition of 
sequences. 

A different solution is to extend the definition of sequences so as to allow length one 
sequences by making the relation reflexive. 

s u (= t mem s) 

Si S2 S3 S„ 

The one element sequence is then; 

0 

So - (sq. So) 
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This still does not solve the problem with repeating elements in sequences, however. 

An alternative definition of sequences is based directly on the pair-making opera- 
tion. Define <> to be some distinguished value 'niL. Then define 

<Xi,X2 Xn> = Xi:<X2, Xj,> 

¥e can see that 



<Xi,X2 Xn> = xi:x2:...:xj^:ml 

(We have assumed is right-associative in the above equation.) This is essentially the 
way lists are represented in LISP. A more comprehensive solution to these problems is 
discussed in Chapter 16, Data Structures. 



10. Binary OperatioiLS 
10, 1 basic concepts 

In this section we will discuss our approach to binary operations — that is, to func- 
tions with two arguments and one result. We have already seen how unary functions 
are connected to relations. For instance, we can write the fact that y is the sine of x 
by either xsiny or y = sinx. Since we only deal with bineiry relations, we will have to 
have a new convention for handling binary functions. This convention is: we \nW com- 
bine the two arguments of an operation into a pair. For instance, w’^e can define a rela- 
tion 'sum' such that (x,y) sum z if and only if e is the sum of x and y . More formally: 

sum = \a:z \ 3x ,y[a = {x ,7j) /\ z =x -^y]\ 

We can use our function application convention as usual, e.g., 

z = sum(x.y) (^.y) sum z 

Now, it would be inconvenient to have to invent names, such as 'sum', for each opera- 
tion, such as ' + Hence, we will adopt a systematic convention for making such names: 
placing the conventional infix symbol for the operation in parentheses or other brack- 
ets. For instance. 
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(x.T/)[ + ]z <=> Z = [ + ](x,y) <-i> 2 =X+1/ 

In fact, if 7T is any infix operation symbol, we will explicitly define its meaning by 

xTry = [7r](x,y) 

This notation will permit us to meuiipulate in a more regular fashion the usual arith- 
metic operations (+, -, x, /) as well as the relational operations (e.g. n, u, t, x). 

We omit the brackets when the meaning is clear without them. For instance, if S is a 
class of classes, then 

img n {Sx.S) 

is the class of all pedrwise intersections of members of S . 

10.2 operations on binary operations 

It is often convenient to be able to generate simple relations from a binary opera- 
tion. Following Russell and Whitehead [WhiteheadTO], let n represent any binary opera- 
tion. We define: 

(rry) = \x:z \ xny = z J 
(xtt) = ly:z | xny = z| 

Hence, x{—l)y y =x-l, therefore (-1) is the predecessor function. Similarly, 
x(l+)y ^ y = 1+x, therefore (1+) and ( + 1) are both the successor function. These 
can be used as functions: (-l)x = x-1 and ( + l)x = x + 1. 

This convention makes it very easy to form more complex functions. For instance, if 
wewant/(x) = sin(l/x) then we can define /= sin. (1/). To see that this works: 

/(x) = [sin.(l/)]x 
= sin[(l/)x] 

= sin[l/x] 

Again, we omit the brackets when the meaning is clear from context or can be made 
clear by spacing. Furthermore, we adopt the convention that if two binary operators 
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occur together, then the first is taken in its unary sense and the second in its binary 
sense. For example, sxuf means [sx]uf , not sx[ijf]. When a binary operator is used 
in its unary sense, it will be taken to be very binding; that is, f xu means /,[a:u], not 

Now observe the action of the {x,) and (,y) functions: 

{x.)y = {x.y) 

{.y)x = {x.y) 

Therefore, for any binary operation rr (except we can define 

^ = [^](.i/) 

XTX = [7r].(a:,) 

Let’s see why this works: 

{xTT)y = [(7r).(x,)]y 
= (7r)[(x.)y] 

= 

= xiry 

The form {ny)x is analogous. In general, if / is a binary function, then /. (x,) and 
f.{,y) are the "partially instantiated" unary functions. This is the effect of Curry and 
Feys 'B' combinator [Curry 58]. 

Since S~^ is the reverse of a sequence, rr.inv is the reverse form of an operation. 
For instance. — .inv is the reverse subtract operation: 

-.inv(x,y) = -(inv(x ,■?/)) 

- y-x 

Thus —.inv can be read 'subtract from' and / .inv can be read 'divide into'. This is 
Curry and Feys 'C combinator (see the next section). 
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1 1 . Com binators 

11.1 paralleling of relations 

In this section we will discuss several powerful operations for manipulating relations. 
These are called combinators because of their similarity to the combinators of Curry 
and Feys [CurrySB]. 

The first combinator we will discuss is the paralleling of relations. R\\S, which is 
defined: 

{u,v)R\\S{x ,y) <=> uRx /\vSy 

So, if / and g are functions, [f\\g]{x,y) = [f (x), g{y)]. Hence, f \\g is the element- 
wise combination of / and g. For example, if we want f {x ,y) = sin x + cosy, we can 
write / = +.(sin||cos) since 

/(^.y) = [L.(sin||cos)](x,y) 

= +[(sin||cos)(x,y)] 

= +[sin X, cos y ] 

= sin X + cos y 

11.2 coaditional union 



The restriction operations allow us to define the very useful conditional union or 
overlay operation [MacLennan75], R.S = R <J ~.domi? ^ S. In other words, the value 
of (R-,S)x is Rx if aredom/?, and Sx otherwise. This has many uses. For example, if / 
is a peirtial function, then /;ld is the extension of / to the identity function. That is, 
(/;Id)x is fx if that is defined and x otherwise. 



The conditional union is useful for defining conditional-like structures. For example 
p ->/ -,g is a function that applies / if p of its argument is true, and applies g otherwise; 



b -»/ >9 



fx if px 
gx if -px 



(This assumes pcdom/ . Why?) Therefore we have the equivalent of Backus' conditional 
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combining form p-^f \g . 

The overlay operation is also useful for updating functions representing tables. For 
example {i,x)\T is a table just like T except that Ti is now x, regardless of whether Ti 
was defined or not. Similarly, is a table in which all the entries of S have been 
added to T, possibly replacing corresponding elements already there. 

11.3 combinatory logic 

One of the simplest combinators described by Curry and Feys is the elementary can- 
cellator, K, defined so that Kr is a function such that (Kr)^=x for all y . That is, K gen- 
erates constant functions. Since Kz is a relation that relates x to everything, we can 
define it: K= (Ox).un, where un == is the unit class generator. To see that this 
works, note that 

Kr = Ox . un X = Ox(un x) 

and therefore that 

u{Kx)v [Ox(un x)]ij 

<=> TxeO A i;eun(x) v=x 

Therefore, for arbitrary li , {Kx)u = x. 

Another combinator is the elementary duplicator, W, defined so that 
(W/ )x = / (x.x). If we define A x = (x ,x) then it is easy to see that W/ is just /. A. For 
instance, x.A is the squaring function: 

x.An = x(Ati) = x(ti,7i) = nxn - n^ 

It should be clear that Backus’ [/ ,5] combining form is just our (/ ||g).A, since 

(/|i3).Ax = /llg (x,x) = ifx,gx) 

Since this combination is so common we will adopt a special notation for it: 
f ~9 - (/llp) A. Hence, {f~g)x = {fx,gx) 
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EXERCISES: Show that some of the properties satisfied by these combinators are: 

iR\\Sr = (A")||(S-) 

(r;s).t = {R.T):(S.T) 

(/?||5).(77ff) = (R.TYXS.U) 

ER = R~R 

{R\\S).mv = (5||/?) = inv.(y?||5) 
inv.(i?75) = SRR 
a.(R:S) = (dom5)-»/? 

— (dom R)-*S 
f?||5 = (/?.«):( 5. CJ) 
cl = (oTG)-* 
cr = (AT<y)"‘ 

EXERCISES: Show that / = +.(x.A ; 2x) is the function / (0 = ^^+2/1 . 

The formalizing combinator, $. is defined so that [<i>/(a,b)]x = / [a(x),6 (x)]. It is 
easy to see that $/ (a, 6) = f.(a~b). For instance, 

/ = $ + [x.A, 2x] 

is just the function f (t) = t^+2t . This can be written directly using the notation of our 
relational calculus: 

/ = +.(x.AT2x) 

The combination $rr[/ ,p] occurs very frequently. Therefore, we define rf = $rr to be the 
formalization of the operator n. Notice that [/Wp]x = {fx)n(gx). In particular, the 
function ft = t^+2t can be written 

/ = x.A T 2x 

In general. n.{f~g) = frig 
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Another combinator is the meta-application operator. which corresponds to Curry 
cind Feys' S combinator; (/®p)x = (/x)@(px). For instance, img®init is the operation 
that gives the set of descendents of roots of a forest F, since 

(img®init)F = (img /’)@(init F) = (img F’).init F. 

Another combinator defined by Curry and Feys is the ^ combinator: 

= f[9ix),g{y)] 

This is simply defined by ^(/,p) = / (pIIp)- Therefore, if / = '!'[ + , x. A] then 

f (x ,y) = x^+y^. This can also be written / = x. A.a T x. A.a>. 

11.4 Curried functions 

A function / is called a Curried derivative of g if fxy = g (x,y). We define operators 
‘curry’ and 'uncurry' such that / = curry y and = uncurry / . First consider 
uncurry: 

(uncurry/ )(x,T/) = g{x,y) 

= fxy 

= {fx)@y 

= @.[/ ||ld](x,T/) 

By canceling [x.y) from both sides we see that uncurry/ = @.[/||Id]. If we wish, we 
can factor / out of this expression in this manner: 

uncurry/ = @.[/||ld] 

= [@][/l|ld] 

= [@][l|Id]/ 

Hence, uncurry = [@.].[||ld]. 

Next consider Currying. One solution is to simply define curry = uncurry^'; we can 
learn more however by defining curry directly. Suppose we are given a Curried pair- 
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making function ’rr’; nxy = {x ,y). Then, 

fxy = {curry g)xy = g{x,y) = g{-nxy) = g{[nx]y) - g.[nx]y 
Therefore, canceling y from each side we gel; 

curry pa: = p. (rrx) = [p. ](7rx) = [g.].nx 
Hence, curry p = [p. ].7 t. 

As a final example we derive Curry and Feys' C combinator; Cfocy = fyx . Observe: 
Cfxy = fyx = ify)@x = [@x](/p) = [@x].fy 
Therefore, by canceling y we get: 

Cfx = [@x].f - [./][@x] = [./].[@]x 

Hence, C/ = [./ 

EXERCISES: Show the following properties satisfied by these combinators: 

(Kr)./ = dom/^Kx 
/.(Kx) = K(/x) 

C = curry, [.inv]. uncurry 

12- Records 

12. 1 basic operations 

By a record we mean a finite function whose domain is other than a contiguous sub- 
set of the integers. For example, the following relation might represent a personnel 
record: 

R = \ name : "Don Smith", age: 40, 

hire-date: {mo:"Aug", dy:31, yr;19B0^, salary: 40000 [ 

The selectors 

name, age, hire-date, mo, dy, yr, salary 
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might be the strings ’'name”, ’'age”, etc. or the integers 1. 2, etc.. We are not con- 
cerned with their exact nature so long as they are distinct. A field is selected by apply- 
ing the record to the field’s selector: 7?(age) = 7?@age = 40. Thus ‘7?@age’ is analogous 
to Pascal or Ada’s ‘7?. age’. Next we will consider how records can be manipulated using 
the relational operators. 

Notice that if is a record of default values (say, for a personnel record) and 7? is a 
record providing values for only some of the fields of a personnel record, then R\D is a 
complete personnel record with defaults from D provided for the unspecified fields of 
7?, If 7? and S are records with disjoint selectors (or with overlapping selectors %\rhose 
values agree) then RuS is a join or combination of these two records. Finally, if 5 is a 
set of selectors, then S-*R is a subrecord of R containing only the fields whose selec- 
tors are in 5. For example, 

\dige, salaryl^T? = \ age: 40, salary: 40000 ] 

12.2 functional records 

A common situation is to apply the same function/ to every field of a record R . For 
example, we might want to negate the coordinates of a two-dimensional point 
P = ^X:10,Y:30j. This is easily accomplished by [0-].P. Therefore P’s Y coordinate is: 

[0-].P @ Y = [0-](PY) = [0"](30) = 0-30 = -30 

In general, we can see that the cp field of /.7? is / (R(p)\ f R@ ^ - f {P^)- 

Nov/ suppose that we have a record F whose fields are functions / i, /s. • • . /n 

^ = Wl-f 9n'fn\ 

We want to compute a record R that has the same shape as F, but with fie’ds whoso 
values are fiX, for a given x: 

R = Wvf • . <Pn'fn^l 

Therefore, for any selector <p, 
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Rtp = Fcpx = {Ftfi)@x = [@x](/'V) = [@x \ R 9 

Hence, R = We define ® Lo be the application of a functional record to an argu- 

ment, F®x = @x.F. Notice that F@ is the function derived from the functional record 
F. Further applications are discussed in §14, on arrays. 

We have seen how f.R applies a function to a record argument to yield a record 
result and F@x applies a functional record to an argument to yield a record result. 
Next we will investigate the application of a functional record to a record argument to 
yield a record result. In the simplest case F and R are the same shape and we want to 
apply corresponding elements of F to corresponding elements of R to yield 
corresponding elements of the result. Thus, if we let 5" be the result record, then for 
any field cp: 

Stp'=- {F(p)@{R(fi) = {F®R)cp 

using the meta-application operator Hence, S = F@R . Therefore, v/e cam apply 
corresponding elements of F to corresponding elements of R by F^R . 

For an example of this operation, suppose that we have the personnel record R 
defined in §12.1 and that we want to compute a new record S in which the age field of R 
has been incremented and in which the salary field has been increased by 10%. We can 
accomplish this by S' = F®R, where 

F - ^name; Id, age; 1 + , salary: l.lx, hire-date: IdJ 

A common situation is to update one field and leave the rest unchanged. The R record 
with its age field incremented is just [age,l+; Kid]©/?. 

We next consider a generalization of meta-application: the outer product of a func- 
tional record and its eirgument record. Suppose we have a record F of functions and a 
record R of arguments; the records F and R are not assumed to have the same shape 
(i.e., the same domain members). We define S = outer/'’’/?, the outer product of F emd 
/? to be a record with the same shape as F, each of whose fields has the same shape as 
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R. That is, if 9? is a field selector of F cind ^ is a field selector of R , then S(p is a record 
in which S(pi^ is the result of applying F(p to R'ljj. That is, S^p - {F(p).R , since {F(p).R 
applies F<p to each field of R and forms a record of the results. Therefore, 
S(p = {.R){F<p) = .R.F(p. This yields the definition of the outer product: 
.R.F = outer/7?. Further applications of this operator will be discussed in §14, on 
arrays. 

EXERCISES: Define an outer product that yields the transpose of this result. That 
is, S(p'\p = {Fij){R(p). 

12.3 relational databases 

Next we consider databases composed of record-sets and define functions that ai^e 
analogous to the relational operations of Codd [Codd?0]. Let be a record set whose 
elements have the selectors {name, age, hire-date, salary]; D might represent part of 
ein employee database. Observe that if / is any operation applicable to a record then 
img/ is a corresponding function applicable to the entire record set. 

For example, to form o. projection composed of just the ‘age' and ‘salary’ fields of D 
we write img[{age, salary] ^]Z^. To compute D' in v/hich every employee in D has been 
given a 10% raise, we can write 

D' = img [({salary; l.lx]; KId)@]/7 

In other words, we are applying a function to each record in D\ this function multiplies 
the 'salary' field by 1.1 and leaves the other fields intact. 

Often we v/ant to choose some selector cp of the records in to be a key and gen- 
erate a function F from D such that Fk is the record in D whose <p field is k . Y/e v/rite 
this F - indexcpD. Observe: 

r - Fk <=> T(p = k /\rcD 
r@ <p = k /\r cD 
<=> [@ 9?]r - k /\r cD 
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<ri> r - [@yj] 'fc /\r£D 
<=> r = {[@(p]~^ D)k 

Hence, F - [@^] ^*~D, so we define indexcpD = [@ 9 ?] ^*-D. 

Another common operation is selection. For example, suppose we want P to be the 
set of all records in D whose 'age' field is greater than or equal to 65. The first step is 
to index the set on the age field: A = index age D. Notice that xAr if and only if record 
r from D has an age field equal to x. We can think of A as a multiple-valued function 
that takes ages into the records having those ages. Thus, if we apply imgA to a set of 
ages then we will get a set of all the records that have ages in the given set. Clearly, 
then 

P = imgA(all>65) = img[index age i?](all>65) 

This leads to a general definition of the selection function: 

selecti/)i? = img[index 5 ^T?] 

Hence, selectjs = img.(indexi; 5 ), so select = (img.). index. With this definition of select 
we can write 

select age D (all>:65) 

to select all those records whose age is greater or equal to 65. 

Finally, we consider the join of two record sets D and E, ]o\n(p{D ,E) . This is com- 
posed of of records formed by combining all those records from D and E whose 9 ? fields 
are equal. To accomplish this, first index D and E on their tp fields: F = indexr/pZ?, 
G = indeX(;«i£’. Let k be any value of the field 93 ; observe that (F||G)A: is a pair (ri,e) 
where dcD. e^E and d and e both have their 93 fields equal to k. Therefore, vie v/ant 
due to be in the join. The set of all such pairs (d,e) is just the range (dom.inv) of the 
relation F\\G. Therefore, to get the join J we must apply the union operation to every 
record pair in the range of F||G: 

J = imgu(dom.inv[F||G]) 
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This is the definition of the join operation. We can factor D and E out of the definition 
thus; 

ioin.(fi{D,E) - [imguj.dom.inv A’llG 

= [imgu].dom.inv[indexi;sTi||index95£’] 

= [imgu].dom.inv,||.(indexi;sZ>,indexi;s£') 

= [imgu].dom.inv,||. [indexes || indexi;s ](/?,£’) 

Therefore. 

joints = [imgu].doni.inv.||.[ indexes || indexes] 

EXERCISES: Factor out of the definition of join. 

13. Ancestral Relations 
13.1 definition 

Carnap [Carnap5B] defines the relation of a property p being hereditary vath 
respect to a relation r : 

p Her r Vary \x^p /\ x r y D y^p] 

<-> img[r“^]p cp 

This leads to the definition of the ancestral of R of the first kind as that relation which 
preserves all the hereditary properties of R. This is also called the reflexive transitive 
closure of R: 

xR *y X Mmr A Vp [p Her A a: ep 3 y ep ] 

For example, if xPy means that x is a parent of y . then xP^y means that x is an ances- 
tor (or the same as) y. The ancestral of the second kind or transitive closure is also 
useful: 

/?+ = R^\R zz R,R* 

Thus, means ‘ancestor’ in the colloquial sense. The easiest way to visualize the 
meanings of the ancestrals is by their expansion as infinite unions: 
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R" = u /?' u (J u ■ • 

R* = /?' u u /?3 u u • ■ 

EXERCISES: Here are some useful properties of the ancestrals. Prove them. 

R^ = R\{=) = 
xR*y 3n.[n>0 A 

R°ZR' 

/?” C /?*, for n>0 
/?" C R^. for 7i>0 
= R^ 

R^ C /?* 

/?' = R\R" 

R* = 

(/?*)-* = (/?-')* 

(y?-)- = {R-r 

(rts)* c r*ts 

Ancestral relations are always transitive. Notice that < and < for integers can be 
defined; 

^ = ( 1 +)* 

< = ( 1 +)" 

That is, x<y means that y can be reached from x by zero or more applications of the 
successor function (l+). The ancestral ‘‘fills out“ all of the paths in a structure. For 
instance, if 

/? = a, ag ttg 

• > »■ > « — ** 



then 
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13.2 applications 

Suppose that S’ is a sequence and we wish to find the first member of S v/hich 
satisfies some property P. First form the closure so that for any two members of 

S* we can tell which is first. Next, eliminate from any members that do not satisfy 

P: S^fP. Then, a{S^fP) is the first member of S satisfying P. 

Next we will consider a simple character manipulation example; stripping leading 
blanks from a string. Note that x {y cl)' 2 means that z is a result of consing 0 or 
more y's on the front of x. Hence, z [(y x means that x is the result of strip- 

ping one or more y's from the front of 2 . To get the desired result it is only necessary 
to restrict the left domain of this function to be sequences that don't begin with a y . 
Suppose Y is the property of beginning with &y. 

xcY <-p y= a X <-> x ay <=> xe allay 

Therefore, the function to strip leading y's from a sequence is; 

[{y cl)']“* «- ~.(alla)y 

13.3 iteration 

Before we leave the topic of ancestral relations, it will be useful to investigate their 
use as a means of iteration. Suppose that A’ is a function (i.e., right univalent). Then, 
since 

f* = \j V u ■ ■ ■ 

we will have xF*y if and only if for some n>0, y = F^x. In general there may be many 
such n, so F^ may not be a function. If F^ is to be a function, it is necessary to pick a 
termination condition (a class) that is only true for one of F^x, F^x, F^x, Therefore 
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consider the relation F^<-~.domF. Let D - '^.domF to see the eflect of this function: 

* * • )<^D = {F^D u F^^D u D u • • ) 

Then {F^^D)x is F^x where n is the unique n>0 such that F^x is defined but F^^^x is 
not. This n is unique because F^^^x undefined implies that for all 7n>n F^'‘x is 
undefined. This leads to the definition of iter F: 

iter F - F^«-^.dom F 

Notice that iter[P->F] will iterate the application of F so long as its argument satisfies 
P (and is in the domain of F). Since it always applies F at least once it is not lil-ce a 
while loop; the equivcilent of the while loop 

while - P do P 

is iter[P“>P];ld, since any input not in P n domP will be passed through. Hence we 
define whUe[P,P] = iter[P->P];ld. Analogously, P|while[P,P] is equivalent to 

repeat P until - P 



14, Arrays 

14. 1 definition and basic operations 

An array is just a function from a contiguous subset of the integers to some set of 
values. If A is an array and i € dom A then A{i) is the i-th element of A. Similarly, if 
I C dom A is a set of index values then imgAf is the corresponding set of array values 
and /->A is the subarray of A selected by those indices. 

It is easy to define multi-dimensional arrays: they are just arrays whose elements 
are selected by sequences of integers, e.g. If M is a two-dimensional arra 3 ^ 

then M. (i,) is the i-th row of M and M. {,j) is the j-th column of M . Also, if / is a set of 
row indices and J is a set of column indices then /xj ^ M is the submatrix of M 
selected by these sets. It is easy to see that M. inv is the transpose of M , since 

i/. inv(i.j) = M[inv{i,j)] = M{j,i) 
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More generally, if F is a permutation function (i.e. a bijection from an index set into 
itself) then A.P is the result of permuting A by P. 

APL-like array and matrix operations are easy to express wth the relational opera- 
tors. For example, if A is an array, then f .A is the array resulting from applying / to 
every element of A. This follows from the definition of composition, {f.A)i = f{Ai). 
Hence, sin. A applies sin to every element of A. Conversely, if F is an array of func- 
tions, then F$x is an array of results obtained by applying each element of F to x. 
That is, (F’Sx)i = {Fi)x. Also, if F is an array of functions and A is an array of argu- 
ments, then F@A is an array of results obtained by corresponding elements of F to 
corresponding elements of A. This follows from {F@A)i - Pi @ Ai. 

Note that if A and B are two arrays with the same domain, then ATB is the 
element-wise sum of these two arrays. To see this, suppose that C = ATB and consider 
an arbitrary element of C; 

Ci = {ATB)i - Ai + Bi 

In general, if tt is an infix binary operation, then tt is the element-wise extension of that 
operation to arrays. If / is any binary function, then /. {A~B) is the element-wise 
application of it to arrays A and B. 

The same approach works for matrices and arrays of higher dimensionality. Sup- 
pose that M and N are two-dimensional matrices with the same domains. Then, 
f = /[it/(i,;)] and 

As for one-dimensional arrays, a matrix of functions can be applied to a single argu- 
ment by M®x, cind a matrix of functions can be applied to a matrix of arguments by 
M@N. 

If A and B are arrays, then C - f. (A||j9) is an outer product by f of A and B. since 
C{i.j) = f (M.Bj). For example. 
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x.([l..l2] ||[1..12]) 

is a 12 by 12 multiplication table. We can also form an outer product betv/een an array 
of functions and an array of arguments. If F is an array of functions and A is an array 
of arguments, then P = ® .(/’||A) is a matrix in which P{i,j) = {f\){Aj). 

EXERCISES: Prove that @.(F]|A) = uncurry. outer F A. 

Suppose X is an element of the array A (i.e., for some i, x=Ai). Then allA x is the 
set of all indices for which x=Ai. Therefore we can find the index of the first occurence 
of a: in A (i.e. APL’s iota operator) by min(allA x). In general, if P is some property 
(i.e. class), then imgA~’ P is the set of indices of all elements of A that satisfy P. A 
sorted reflexive sequence of these indices is just < T imgA“'P 

14.2 relation to sequences 

It is easy to convert arrays to sequences and vice versa. Suppose all the elements 
of A are distinct, then A^^ is a function that returns the index of an elem.ent of A. \Ie 
want to define a sequence S' such that xSy if and only if x proceeds y in A, i.e. the 
index of x is one less than the index of y . To put this functionally, we want to define 5 
so that y-Sx meajis that y is the successor of z in A, i.e., that the index of y is one 
greater than the index of x . 

y - Sx A~^y = A“'z + 1 
<=> A~^y = (l+).A“'x 
y -= A. (l+).A“'z 

Hence, 5 = A (1+).A~‘. Notice that this is just the image of the (1+) structure under 
the function A“'; 5 = A'*5(l+) (the ^operation is discussed in the next chapter). 

Next, we will consider the opposite process; converting a sequence to ein array. 
Suppose we have a sequence; 

S = gp 0-1 gg gg 
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We wish to convert this to an array; 



0 


tig 


1 




2 


dg 


3 


dg 



Thus, for each element Oi in the sequence, we must find its index i in the resulting 
array. If we can define a relation R such that R{Oi)=i then /?“* will be the array we 
seek. Now R(ai) is just the number of predecessors of Oj in S. That is, ug has no 
predecessors, so R{a-o) = 0; ug has two predecessors, so ^(^ 2 ) = 2, and so on. Since S 
defined an immediate predecessor relation, S* defines an ancestral predecessor rela- 
tion; 




Since xSy means z is a predecessor of y , y=Sx means 1 / is a successor of x. Thus the 
set of successors of any element a is then unimg5*a, and the set of predecessors of a 
is unimg.inv5'^a, e.g. 

unimg.inv5*a2 = luo. 

Alternately, unimg.inv5^a = all5^a is the set of all elements that beeir the S* relation 
to a. The size of this class is then the desired index; 

size(all5^ii2) = 2 

Hence, R{a) = size(all5^a), so R = size.(all5'^). Now, we know that A is R~\ so we can 
define the function saO which converts a sequence into a 0-origin array; 
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sa0 5 = [size.(all50]"‘ 

To produce a 1-origin array, the only alteration is: 

sa 5 = [size. (all5*)]”^ 

14.3 other cinray operation's 

Next we will consider the concatenation of arrays. If A is an array such that Ai = 
then we can write A : 

A = 2:a2, .... 

where m is the length of the array. Similarly, suppose that jP is an n element array, 
then the concatenation of these arrays is 

A cat B - , m.cim,] U \m-\-l\bi, .... 

We can see that A cat B = AuB‘ where B' results from B by shift its indices by m\ 

B' = . . . , m-^-n.bj^] 

How do we compute B'? Observe: 

B'i = B{i—m) = - B.{—m)i 

Hence, B' = B. (-m) and A cat B - A B. (~m), where m is the length of A. The 
length of A is just size.dom A, so 

A cat B - A D B. (-size.dom A) 

We will finish our discussion of arrays by investigating the generation of sorted 
arrays, Let 5 be a set of integers to be sorted, then is a structure which relates 

lesser elements to greater elements. Now if x is any element of the set, all[<T5']x is 
the set of all elements less or equal to than x. Thus size(all[<?5]x) = size, (all is 

the number of elements of S less than or equal to x. This is just the index of x in the 
sorted array we seek. Hence if A is the sorted array, iAx if and only if 
x[size.(all so A = [size. (all Of course this can be generalized to any 

ordering relation 
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15. Isomorphic and Homomorphic Images 
15.1 images 

Consider any relation R and any biunivalent function / . If we teike each node n oi R 
and replace it by fn we get a relation closely related to R called the image of R under 
/, symbolized f SR. 

a 



R f SR 

Figure 13. Image of a Relation 

It is easy to define f SR . Observe that if 5 = f SR then {fx)S{fy) just v/hen xRy. 
Conversely uSv whenever there are x and y such that xRy ,u=fx and v =fy . Hence, 

uSv 'Bxy [xRy A u = fx A v =fy ] 

3xy[xfu /\ xRy /\ yfv] 

3xy[uf~^x/\xRy/\yfv] 

u[f~'^\R\f]v 

Y\ence,fSR =/“‘|^l/ 

The image operation is also useful when / is not biunivalent. For example, if fb = fd 
then / SR (with the R in Figure 13) is; 

fe fb -fd 

I.e., we merge the nodes corresponding to b and d. 

The S operation is clearly related to the img operation — they both compute the 
image of a structure. Since -!?.[/ 1|/ ].un x:t/ = {fx)\{fy), we have this relationship 
between the images of relations and sets; 

fSR = img(T?.[/i|/].un)/? 
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That is. fS = img(i3.[/ 1|/ J.un). 

The image operations have many uses. For example, since [l..n] = (1,2, . . ,n), v/e 

can see that 

(m+)^l,.7l] = (771 + 1,771+2 771 +7l) 

Hence the identity [tti + L.tti+ti] = (77i+)S[1..ti]. To compute a list of the powers of two 
from 2° to 2’® we write (2 t)S[ 0.,16], where x7y = . Finally, to compute a list of the 

sines of the angles from 0“=0 rad. to 90 °=tt/ 2 rad. we write 

sin.(x7T/ 160) S[0..90] 

To see that this works: 

sin.(xTr/ 180) 5 (0,1 90) 

= ( sin.(x7T/ 180)0, sin.(x7T/ 160)1, . . . , sin.(x7r/ 180)90 ) 

= ( sln(0xrr/ 180), sin(lX7T/ 180) sin(90x7T/ 180) ) 

= ( sin 0°, sin 1“, . . . , sin 90° ) 

EXERCISES: Show that inv = [i?.inv,un]5. 

15.2 images of functional structures 

We have seen how, given a function / and a relation of values V we can form a rela- 
tion / SV in which the shape of V is the same as the shape of / SV and each member 
uememF corresponds to fv in f $V. Now we will address the converse problem: given 
a relation of functions F and a value v , how can we construct a relation F\v such that 
the shape of F is the seime as the shape of F\v and each member / ememF 
corresponds to fv in F\v. This is clearly the image of F under some unl<nown function 
f. F\v = <pSF. We will solve for 95 . Observe <pf - fv - f@v = {@v)f . Hence 93 = (@ 77 ) 
and F\v = {@v)SF. That is, F\v is the image of F under the operation 'apply to v‘. V/e 
can eliminate v from this definition: 

{F\)v = F\v = {@v)SF = {SF)i@v) = {SF).@ v 
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Therefore F\ - {SF).@. Notice that the F\ is a function derived from a functional 
structure just as F@ is a function derived from a functioned record. 

We now consider some applications of this operation. To form a sequence by apply- 
ing each of a sequence of operations to the same eirgument we write, for example: 

(sin.cos,tan)!i? = (sin'd, cos t 3. tani?) 

In particular, {f ,g)\ is just f~g and Backus' constructor [f ,g A] is just our 

[f '9 ^]!- 

Recall our previous example in which we computed the sines of the angles from 0° to 
89° by 

sin.(xrr/ 180) S[0..B9] 

We Ccin extend this to compute a table of the sines, cosines and tangents of the angles 
from 0° to 89° by using both of the image operations: 

(sin,cos,tan)!.(x7r/ 160) 5[0..89] 

This produces a sequence of sequences of the form 

((sin 0°, cos 0°. tan 0°), (sin 1°, cos 1°, tan 1°) (sin 89°, cos 89°, tan 89°)) 

In general F\$R has an outer structure the same as R's, each of the elements of v/hich 
has a structure the Scime as F's. Thus it is sort of an "outer product" between F and 
R in which the members are fr for / Mm F and r Mm R. 

To convert a sequence of sequences such as this into a matrix requires an applica- 
tion of the sa operator at each level of structure. Let S be the sequence of sequences. 
First convert each of its elements to an array by s&SS. Next, convert the resulting 
sequence to an array by sa[sa^5]. The result of the latter operation is an array of 
arrays that can be converted to a two dimensional matrix by uncurrying. Thus the 
sequence to matrix conversion is 

ssm S = uncurry. sa[sa55'] 
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Therefore, 

ssm = saS | sa | uncurry 

This can be read: To convert a sequence of sequences to a matrix, convert each of its 
elements to an array, convert the result to an array, and uncurry that result. 

EXERCISES: Define an outer product operation VFR which has the outer structure 
of F but the inner structure of R . Thus, the matrix corresponding to P/7? is the tran- 
spose of the matrix corresponding to F\SR: 

(ssm PF7?).inv = ssm{F\$R) 

15.3 isomorphism and the structure function 

Carnap [CarnapbB] defines two relations to be isomorphic when there is a 
biunivalent relation between their members that preserves their structure. That is, R 
is isomorphic to S: 

Rc^S 3/ebun[/? = f SS] 

Thus, two relations are isomorphic if one is a biunivalent image of the other. 
Equivalently, two relations are isomorphic if their arrow diagrams are equivalent when 
their node labels are removed. The isomorphism of sets is defined in the same way: 

<=> 3/Ebun[5 = img//] 

The structure of a relation is arrow diagram for the relation with its node labels 
removed. For example, the structure of R in Figure 13 is: 




Thus two relations are isomorphic if they have the same structure. Mathem.atically the 
structure of a relation is just the set of all relations isomorphic to the given relation: 
str R = \S |5~/?| = all~/?. Thus str R is an equivalence class under Alternately, 
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slrR = iS\S^I?l 

= |5| ^€bun[5=/5/?]J 
= img[S/?]bun 

That is. strR is the class of all biunivalent images of R. Note that Rc^S strR=strS . 

The structure of a set is defined in exactly the same way. Since Cirng^F = imgF5, 
we have 

str 5 = = alias' = img[Cimg5]bun 

Observe that if, following Russell and Whitehead [WhiteheadTO], we define a number as 
the class of all classes isomorphic to a given class, then the size of a class is just the 
set of all classes isomorphic to that class: size S’ = (TlTiiiSI. But this is just the 
definition of the structure of a class. Hence for all sets S, size S = str S. In other 
words, the structure of a set is its cardinality. When the identity of its elements is 
ignored, the only structural characteristic still possessed by a set is its size: 

strjl,8,25 = str{cat,dog,cowj = | = 3 



16. Data Structures 
16.1 definition 

Simple relations are not adequate for modeling all structures. For example, sup- 
pose we write this sequence: (1,2,3, 2,4,5). This is defined to be the relation 

I 1:2, 2:3, 3:2, 2:4, 4:5 | 

To make its structure more apparent, we will drav/ this as an arrow diagram: 



R 




This is certainly not what we expected, and it will not give the results wc expect. For 
example, we cannot scan through this “sequence” because /?(2) is multiple valued. 
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To avoid this problem it is often better to use data structures (or interpreted struc- 
tures). A data structure 5 is a pair (DM), where is a relation (a simple structure) 
that defines the form part of the data structure, and Z? is a function that associates 
data values with the members of R\ it is called the data part of the data structure. 
Usually domi? = memZ?, but this does not have to be the case; we will see examples 
later. 

The structure that we intended by writing (1,S,3,2,4,5) can be correctly represented 
by a data structure (DM) in which R ~ (a,b,c,d,e,f) and 

D = {a;l, b:2. c:3, d:2. e:4, f:5}. 

It doesn’t matter what a, b, c, d, e. f are, so long as they are distinct. We will VvTite data 
structure sequences with angle brackets: <1,2,3,2,4.5>. 

16.2 operations on data structurijs 

We need functions for both interrogating and updating data structures. The data 
and form parts of data structures can be extracted by a and cj, respectively. In partic- 
ular, if n is a node in S. n^mem(oS), then aSn is the value associated mth that node. 
A common situation is to inquire the value of a node selected by applying a function / 
to the form of a data structure: we write this vfS. For example. vaS is the value of 
the first element of S and ^){oS)S is the value of the second element of S . In general, 

vf(DM) = D(fR) = D@ fR ^ @{DJR) = [Id||/ ](/;,/?) 

Therefore, vj - [ld||/ ] and v - (@. ).(ld||). 

Next we define operators (p and 6 that alter their argument function so that it 
operates on either the form or the data part of a data structure, but leaves the other 
part unchanged. That is, cpf (DM) = (DJ R) and 6f (DM) - (f D M). Therefore 
(pf = ld||/ and 5/ = / ||ld, so (p - Id|| and 6 = ||ld. 

We will define an operation U such that Ilf S is the image of the structure 5' under 
the function / , that is. UfS is a structure with the same form as S but with values 
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derived by applying / to the data of S . Thus UfR is the analog for data structures of 
/ SR for relations and f .R for records and arrays. Suppose S = (D.R) and 
UfR = (D',R). For any nememi? we must have D'tl = f(Dn), so D' = f.D = [f ]R^ 
Hence, we get UfS from S by applying [/. ] to the data part of S, so TlfS = 6[f. JS", 
and n/ = 5[/. ]. For example, if S is any data structure whose values are numbers, 
then n[l+]5' adds one to each element of the data structure. 

The n operator leaves the form of the data structure unchanged; next vre consider 
operators that reform data structures. First we define operators that filter a data 
structure by removing some of its nodes. In the simplest case we just throw away the 
nodes we don’t want, only retaining those that satisfy a given property P. Hence, 

{D,R') = (D.RrP) = {D,['fP]R) = <p[fP]{D.R) 

Hence filters a data structure by eliminating all those nodes that do not satisfy 

P. Suppose that we want to eliminate the negative nodes of a data structure. Thus v.'^e 
wanti£P aSn^O ti (a^) |> 0, so P = all(a5’ |>)0. 

This simple form of filtering will often lead to nodes becoming isolated. That is, if we 
filter the sequence 

< 3, 4, -2, 6, 7, -1, 2, -4 > 
by the set P = all(a5 |Sr)0 then we will get 

3 *4 6 V 2 

Note that the node whose value is 2 (a positive number!) is not even in the relation 
anymore since it has no neighbors (it is still in the data mapping, however). Usually vre 
would prefer to connect up the remedning elements of the sequence, yielding 
<3,4,6,7,2>. How can this be accomplished? 

We will define an operator $ such that $P5 is the data structure resulting from 
filtering the data structure S by the predicate P. Suppose S = {D.R) and 
$P5 = (D,R'). R' will be derived from R by adding some new pairs to PtP. In 
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particular, we want to add just enough pairs to directly connect those nodes that were 
indirectly connected in li but are not indirectly connected in R7P. Wc will call this 
operation so R' = (/?. 

Observe that xR^y if and only if y is reachable from x in one or more steps. Simi- 
larly x(R \R^)y if and only if y is reachable from x in two or more steps. Therefore, 
first teike our original relation R and compute R*: 




Then eliminate the undesirable members by restriction, 5 = A’TP': 




There are clearly many redundant edges here. We want to eliminate any edges that 
can be generated from the others; that is, we want a rninirrial set of edges. Since 5 
cire all the edges of length two or greater, these are the redundant edges; 




If we delete these edges from 5 we will have only the nonredundant edges left, so 
5\(5|5^) is 




3 4 6 7 2 



We can now define (. First we define a useful operation fx that minimizes a relation 
by eliminating all of its redundant edges: fxR = R \ R\R*. To see that tins works just 



- 50 - 



RELATIONAL PROGRAMMING 



expand the transitive closure: 

IxR = R\R\R^ 

= R \R\{R^uR^uR^u ■ ■ ■) 

- R\ {R^^R^uR"^ • • • ) 

= R\R^\R^\R^^\ • • 

Hence, to filter a relation f? by a predicate P we use (jl[R*^\P]. Therefore, 

^PR = R' = 

= /i.[TP][tracP] 

= /i.[l'P].trac R 

where we have used trac R = Hence we have that 

(P = ;U.[l'P].trac 

Notice that this definition is really quite readable. It says, "To filter a relation, com- 
pute the transitive closure (trac), eliminate undesirable nodes [tP], and eliminate 
redundant edges (yu.)." 

We now want to extend ^ into the operation $ on data structures. Recall that ^PS 
means that a node is to be included in the result only if its value satisfies P. Hence, if 
S = (D.R) then we want to filter P by P where neP if and only if Dn€.P. Now, the set 
of all nodes whose value is in P is just the inverse image of P under D, F - img.invPP. 
Therefore, we want to filter P by img.invPP, which we do by ^(img.invPP)P. Hence, 
$P(P,P) = (P. ^[img.invPPjP). We can factor (P,P) out of this equation: 

$P5 = $P(P.P) 

= (P. ^[img.invPPjP) 

= (P, |[(@P).img.invP]P) 

= (P, (.(@P).img.inv P P) 

= (P, uncurry[^.(@P).img.inv](P,P)) 

= {aS , uncurry[(.(@P).img.inv]S) 
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= (aT uncurry[^.(@P).img.inv])5 
Therefore, 

= aT uncurry[(.(@P).img.inv] 

17. Reducing Structures 
17. 1 basic concepts 

In this section we will discuss several methods for reducing structures, that is, for 
applying a function to each element of a structure and accumulating the results. Since 
no one method has yet been selected, this section should be taken as a report of work 
in progress. 

A general paradigm for processing a structure, such as a file, is the following: 

1. Perform some initialization. 

2. Read the next (or first) element of the file. 

3. Take this value and the results of processing the previous values. 

4. Process these to yield new cumulative values and continue from step (2). 

5. When the end of the file is reached, return the accumulated result of processing all 
of its elements. 

A simple form of this appears in APL's reduction operation: 

+/F = VM - ■ (^n-.+ Vn) • • ■ ) 

A more general form is Backus’ insert: 

/ f <^\ ^n> = / / :<x„ 

Our first example of scanning structures will be to express this operation in the rela- 
tional calculus. 
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17.2 reduction of arrays 

We are given an n element array A and wish to compute; 

t = A(ti)+A{ti-1)+ ■ • • +A(2)+A(1) 

where we have assumed that the domain of A is [l..n]. We saw in the section on ances- 
trals (§13) that iter[~7’ ^ F] will iterate the application of F with T used as the termi- 
nation condition. Consider how the analogous loop would be written in Pascal: 

S := 0; i := 0; 
while i5>^n+l do 

begin S := S+A[i]; i ;= i+1 end 

On each iteration two functions are performed: S is increased by A[i] and i is incre- 
mented by 1. Let’s represent the state of the computation by a pair (s,i), where s Is 
the cumulative sum so far and i is the index of the next element to process. We will 
use F to represent one processing step, so that, if (s',i') is the new state, we can solve 
for F as follows: 

F{s,i) = (s’,i') 

= (s +Ai, i + 1) 

= ( + [s,Ai], [l+]i) 

= (+.[ld||A](s,i), [l + ].w(s,i)) 

= (+.[ld||/4]T l + .w)(s,i) 

Hence, F = ( + .[ld|lA]T l+.w). 

It remains to determine the termination condition, T. If x is a state, i.e., a pair 
(s,i), then xeT when i=n + l. Hence, xeT when cjx = n + 1, so T is the set of all state.s 
mapped by oj into n + 1. Hence, T = all wn + 1. The final state, xj , containing the sum 
is iter[~T -> F]Xj, where Xi=(0, 1) is the initial state; 

Xf = iter[~7’ ^ F](0,1) 

Now, the total t is just axj , so 
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t = a.iter[~r -» A’](0,l) 

We can generalize this to any function / with initial value i: 

t = a.iter[~7’ ^ A’](i, 1) 
where F = (/. [ld||/4] 7 l + .cj) 
and T = all u (l+sizeA) 

This result can be improved by directly extracting the result from the final state. 
That is. we want to define a filter tp such that t = Hence we want xjtpt, so 

Xf<pt {t,7i + l)(pt. Now. note that [.n + l]f = {t,n + l), so f [.n + l](f .n + l) by the 
definition of application. Therefore ip = [.n4-l]~* and we have the simplified formula 
t = [.■n. + l]“'.A’*(i.l). This leads us to the following definition of the array reduction 
operation; 

(fti)A = [.■n. + l]~L/’*(i.m) 
where F = (/ ,[Id||/4j71+ w) 
and m = min(dom A) 
and n = max(dom A) 

Therefore, if A is an array indexed m to n, then (+tO)A is the summation of A, 

EAz 

i =m 

Using this operation, the inner product of arrays A and B can be written simply as 
+$0(A5?fi’). 

EXERCISES: Show that +$0(A5?F) is the inner product of A and B . 

17.3 reduction of sequences 

Next we will consider the scanning of sequences. Suppose 5 is a sequence: 

■5 = (s„S2 s„,EOF) 

where EOF is an "end marker"; it can be any value. Now, we wish to find the result 
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i / Si / sg/ • • / s„ 

that is 

/(/(••• /( -i. Si ) • • • ), s„) 

for some function / and starting value i. The state can be represented by a pair (^,s), 
where t is the result so feir computed and s is the rest of the sequence to be pro- 
cessed. Hence, {t'.s') = F{t ,s) where t' = f {t ,as) ands' = Os. Therefore, 

F{t ,s) = = (/[i,as], Os) 

= (/. [Id||a](^ ,s), Q.w(f ,s)) = (/. [Id||a]T Q.u){t ,s). 

Hence, F = /. [Id||a] 7 O.cj. 

What is a terminal state? Notice that n(s„,EOF) = 0, so a terminal state will have the 
form (r,0). Thus the set of terminal states is the set of all those states mapped into 0 
by o: allw0. Hence, 

r = while[~ allw0, F]{i,S) 

To put this in a more useful form, we will define a function /§i such that r = (/§i)5. 
This is simply 

/§i = while[~ allw0, /. (Id||a)T n.w] . (i,) 

Then, the sum of the elements of a sequence S is just ( + §0)S. 

18. EIxamples 

In this section we will give several examples of relational programs. 

18.1 payroll 

Suppose we have a file f of employee records, where r = is the record for the 
employee with the employee number n. We will suppose that employee records are 
functions defined so that; 
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rN = employee name 

rH = hours worked so far this week 

rR = pay rate 

We are given an update file U such that Un is the number of hours worked by employee 
n today. We wish to generate a new payroll file 

SOLUTION: Let r = $n and r' = $'n be the old and new employee records. It is clear 
that r' is the same as r except for its H field. In order to modify part of a relation, \re 
will use the conditional union (or overlaying operation) defined by: 

R\S = /? u ~.dom/? S 

Then, if A' represents the new value of the H field, the new employee record is 
r' = r, where A' is just the cumulative hours worked, h' = $nH + Un. Therefore, 

by the definition of 

= r’ = 

To find we must factor out the employee number n. To do this, note that $nH = 
[®H]($n) = That is, is a slice of the payroll file: the hours 

worked for each employee. Therefore, 

A' = $71 H + Un = [@H].$7i + Un 
= ($@H T U)n 

Now. define the updating function u by 

u{n) = [H, ($@H T U)n] 

= T U)n 

Then, $'n = 7 x(ti); $ti = [^~\u]n. Therfore, the solution to our problem, the new payroll 
file, is 

$' = 11 ]^, where u - [H,].($§H T U) 
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18.2 check issuing 

Suppose we wish to take the payroll file from the previous example and generate 
checks for the employees. We will assume that a function C is available such that 
C{m,p) returns a check in the amount p made out to the name m. 

SOLUTION: We will ignore overtime computations. Hence, if n is an employee 
number then 4>nN is his name and 

p(n) = $nH X $nR 

is his pay. Therefore p = ^>@H x $SR. Now observe that his check c{ti) is 
c(n) = = C($nN,pn) = C($@Nn,pn) = C. ($@N7p)n.. Combining these we 

have the file F mapping employee numbers into checks: 

F - C. [$@NT ($@H X ^>@R)] 
from which we can factor out the old payroll file: 

F = a [@N7(@HX@R)].$ 

If we just want a set of checks, this is dom.invF. 

18.3 pseudo-natural notation 

Relational programs can be made less intimidating by using the pseudo-natural 
notation described in [MacLennanBS]. This notation uses words in place of symbols and 
uses a comma convention to suppress many parentheses. The frequency table pro- 
gram from §1, F - size, (all T), can be written: 

Treq-table' means all text then size. 

Here, 'Freq-table' = F and 'text' = T . 

The payroll example looks like this in the pseudo-natural notation: 

‘Updates' means: 

Old-Master slice Hours, each add Hours-Worked, 

then pair-with Hours. 
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'New-Master' means Updates each replace Old-Master. 

Here, 'Updates' = u, 'Old-Master' = 'slice' = 'Hours’ = H, 'each add’ = T, 'Hours- 
Worked' = U, ‘then’ = |, 'pair-with = rr, ‘New-Master’ = and 'each replace’ =T. 

The check issuing example is also easily put into this notation: 

'Checks’ means 
Old-Master then: 
something slice Name 

also something slice Hours each times something slice Rate, 
then Write-check. 

Here, 'Checks' = F, 'something' represents an omitted argument, ‘Name’ = N, ’also’ = 
7, ’each times’ = 5?, 'Rate' = R, and ‘Write-check’ = C. 

19. Implementation 
19. 1 introduction 

The primary goal of our investigation has been to determine if relational program- 
ming is significantly better than conventional methods. It would be premature to 
devote much effort to implementation studies before it is even determined if relational 
programming is an effective programming methodology. However, a brief discussion of 
implementation possibilities is probably not out of line. 

The most obvious representation of a relation is the extensional representation, in 
which all the elements of a relation or class are explicitly represented in memory. 
There are many kinds of extensional representations, such as hash tables, binary trees 
and simple sorted tables. Of course, performance can be improved through the use of 
associative memories and active memories (in which each memory cell has a limited 
processing capability). 

Some relations and classes will be so large that it is uneconomical to represent 
them explicitly in memory. In these cases an intensional representation 
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[MacLennan?3] should be used. Here a class or relation is represented by a formula or 
expression for computing that relation or class. Operations on the class or relation are 
implemented as formal operations on the expression. This is feasible because of the 
simple algebraic properties satisfied by relations. It can be seen that an intensional 
representation is really just a variant of a lazy evaluation mechanism [Henderson?6, 
HendersonBO]. Sometimes an intensional representation is necessary; for instance, 
relations of infinite cardinality, such as the numerical operators and relations, require 
an intensional representation. 

19.2 computability 

It can be shown on theoretical grounds that some of the operators we have 
described are not implementable in their full generality. For example, if unimg v/ere 
applicable to all computable functions, it would be possible to solve the halting prob- 
lem, since 

Halts(/ ,x) unimg/x 9 ^ 0 

Since the halting problem is not solvable, we cannot implement unimg and the other 
operators used in the definition of Halts so that they works on all computable func- 
tions. Similar arguments set bounds on the implementability of many of the other 
operators. 

These limitations do not prevent the use of the relational operators as a 
specification language. For this purpose it Is only necessary that relational programs 
precisely specify the relationships between inputs and outputs, not that the programs 
be implementable. However, if we vash to use the relational operators for executable 
specifications or for a full-fledged programming language, then the issue of implemen- 
tability becomes important. 

19.3 extensional representation 

It should be clear that all the operators are implementable on extensiouxally 
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represented sets and relations, that is, on sets and relations whose elements are expli- 
citly listed in some form in the computer's memory. Obviously, only finite sets and 
relations can be represented extensionally. Suha Futaci [FutaciBS] has analyzed the 
complexity of the algorithms associated with several different extensional representa- 
tions. 

19.4 mtensional representations 

Infinite sets and relations must be represented intensionalLy, that is, without expli- 
citly listing their elements. There are several ways of accomplishing this. For exam- 
ple, infinite sets can be represented by their characteristic functions : total, comput- 
able, Boolean-valued functions that determine whether or not a given element is in the 
set. Since we require these functions to be computable they can be expressed in a 
finite algorithm and so are finitely reresentable in the computer's memory^. 

Another intensional representation of infinite sets makes use of computable 
enumeration functions. If / is an enumeration function for a set then /(l), / (2), ... 
are distinct elements of the set. If n is greater than the cardinality of the set, then 
/ (n) might not halt. 

One of the most common intensional representations of infinite relations makes use 
of the corresponding computable function. That is, the computable function / can be 
used to represent the relation R when y-f {x) {z:y)^R. Clearly, this representa- 

tion can be used only when R is right univalent. Also, if x £ domf then the computa- 
tion of / (x) might not halt. 

19.5 eliminating polymorphism 

When we investigate each of the various extensional and intensional representations 
of sets and relations, we find that different combinations of the operators are imple- 
mentable on each representation. This could lead to a very confusing situation for the 

3. Of course, ccmputabCe characteristic functions only allow (by definition) the representation of re cursive 

sets. There is little to be lost in restricting our attention to recursive sets, however. 
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TABLE 1. Sets Represented by their Characteristic Functions 



.Set Operation 


Operation on Characteristic Function 


xeS 


S{x) 


X ^ S 


-S{x) 




-.S 


SnT 


S7\T 


SuT 


SSZT 


S\T 


S7\-.T 


SdT 


-.S\TT 


SxT 


MS\\T) 


xRy 


R{^ .y) 


unimg/?x 


R. (,x) 


alliTx 


R. (x.) 


inv/? 


R.inv 


S^R 


S.a7\R 


R^S 


R 7\S. a 


/?t5 


A A.(5||5) 


R\\S 


R. aS A S. coS 


R~.S 


R. Hdlla) A 5. (Idlb) 



relational programmer. Without consulting a table of some kind the programmer 
would never be sure whether or not a particular combination was implementable. 
Therefore, relational programming will be simplified if we can divide the operators into 
disjoint classes in such a way that each operator is applicable and implementable on 
exactly one representation. Fortunately, when we investigate the use of the relational 
operators we find that certain operators are mostly used on finite sets and relations 
and others are mostly used on computable functions. Thus we have a basis for a divi- 
sion of the operators. 

To accomplish this goal it is necessary to eliminate any polymorphism, that is, any 
operators that are both implementable and useful on more than one representation. 
For example, the set operations (n, u, \, etc.) are useful and implementable on both 
finite sets and infinite sets represented by characteristic functions. However, the set 
operations on infinite sets are easily expressed as abstractions and compositions of the 
Booleam operations applied to the corresponding characteristic functions; see Table 1. 
The simplicity and directness of this representation of infinite sets and their operators 
permits us to eliminate them as basic objects in relational programming. Thus, the set 
operations (n, u, \, etc.) will only be allowed on finite sets and relations. 
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Since we have eliminated characteristic functions as built-in representations of 
infinite sets and relations, we are left with only two others: enumeration functions and 
computable functions (for right univalent relations). We have chosen to eliminate 
enumeration functions because they have few uses and these can be easily expressed 
using the functional operations. 

This leaves us with two classes of objects in relational programming: 

• Finite sets (and hence relations) 

• Computable functions 

There are only a few operations that are both useful and implementable on both of 
these classes. For example, the application operation can be used both for applying a 
computable function to its argument and for looking up an item in a table (a finite rela- 
tion). Therefore we define two versions of this operation: f@x, which applies the com- 
putable function f to x , and t^x (suggesting subscripting), which applies the finite 
relation (table) t lo x . We allow f@x to be abbreviated fx and tlx to be abbreviated 

For some of the polymorphic operations either the intensional version or the exten- 
sional version can be easily expressed in terms of other operations. In these cases the 
easily expressible version can be dropped with little loss of convenience. An example of 
this is img.inv/j), where / is a total function and p is a characteristic function. This 
can be written p./ since p./ is the characteristic function of img.inv/p. 

19.6 extensioDal operators 

The results of the separation process are displayed in Tables 2-5. Table 2 lists the 
primitive operations on finite, extensionally represented sets and relations. These 
operations are considered primitive because they are not simply defined in terms of 
other operations. Tables 3 and 4 show the non-primitive operations on extensionally 
represented sets and relations, that is. those that can be simply defined in terms of 



- 62 - 



RELATIONAL PROGRAMMING 



TABLE 3. Primitive Extensional Operations 



Onerator 


Meaning 


tlx 


application 


t |tx 


relative product 


t~u 


construction 


x\y 


pair formation 


s Ut 


union 


unx 


unit-set formation 


cur t 


Currying 


unc t 


un- Currying 


l^X 


unique element selection 


sizex 


cardinality 


str^ 


structure of relation 


r 


transitive closure 



TABLE 3. Non-primitive Extensional Operations (Part 1) 



Qpe.ratpr._ 


Definition 


{x.y) 


un(x:7/) 


(^.} 


un. (x:) 


(.?/) 


un.(:7/) 


At 


(x.x) 


X 


or.img[a:=]^ 


SQt 


and.(img[e^ ])s 


S=t 


SQt A.tQS 


inv t 


img[:.(Tl.Hd)!]^ 


dom t 


img Hd t 


rng t 


dom. inv t 


mem t 


dom^ u rng^ 


Lm(x,f ) 


xedom^ 


Rm(a:,f) 


xerng^ 


Mm(x,f) 


xemem^ 


run t 


and . img(l = .size.[unimg^ ]) . dom^ 


lun t 


run. inv t 


bun t 


run^ A lun^ 


init t 


dom^ \ rng^ 


term t 


rng^ \ dom^ 


f* 


u (img :.A).mem t 


p 


mer{p.m)t 


t <-p 


mter{p.Tl)t 


tvp 


TD-*t <^v 



other operations. Although these operations are non-primitive, we would expect that 
they would be built-in in a relational programming system. These definitions make use 
of several new primitive operations, v/hich are defined in Table 5. They also make use 
of the operations on elementary pairs: lid = a.un and T1 = cj.un. 
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TABLE 4. Non-primitive Extensional Operations (Peirt 2) 



Operator 


Definition 




a 

Ca) 

At 

n t 

t \u 

X c\t 

t cr X 
min s 
max s 
S C\t 
s\t 
^0..7n j 
\Tn..7i\ 

t%x 
t \x 
lit 

indexcpt 
select9P 
join^ 
as t 
sa t 

saO t 
rp ft 
rpi ft 
t cat u 
rsort s 
sort s 
unimgix 
uni mg ^ 
ssm 


-d.init 
-d.term 
t ^ fL termf 
initf -> t 

t u It domf -> u 
{x ,at)ut 
t u{ot ,x) 
a.(img<) s xs 
«.(img<) s xs 
dom[s -» fxunO] 
dom[ It t -* s xunO] 

/”‘(0,un0) where / (n,s)=(n + l,suunn) 
img[7n +]|0..n —m J 
img[:.A]^m..n j 
img(HdT @x. 11)^ 

@xM 
t \t\t^ 
img[j.95TId]i 
img,(indeX95) 

imgu . dom . inv . 1| . [index^||index^] 
img[A4.T Ai.l+](domi \un.max.domO 
/§(1.0) 

where /[x,(i,a)] = (i + 1, au[i,x]) 
/§(O,0) (/ defined above) 
img[HdT /.Tl]f 
img[/.HdTTl]^ 
t u rpi[+size.dom f ]u 
img ^ s xs 
size. inv.all, rsort s 
rng[unx -* < ] 

tmg/(dom0 where fx = x:(unimg<x) 
unc.sa.sa,!? 



19.7 intensional operators 



All the intensional operators can be expressed using recursive definitions and 
lambda expressions. Nevertheless, it is useful to divide these operators into two 
classes, primitive and non-primitive, on the basis of whether they can be easily defined 
in terms of the other operators. The intensional operators eire shown in Tables 6 and 7. 



20. Conclusions 

Of course, we are not the first to propose introducing aspects of a relational calculus 
into programming. Codd [Codd70] has used a relational calculus as the basis for data 
base systems. Although he defines several operations on relations (viz., permutation. 



- 64 - 



RELATIONAL PROGRAMMING 



TABLE 5. New Primiti ve Extensional Operations 

and ^t rue J = true 
and^falsej = false 
and^true,false| = false 

or^truel = true 
orjfalsej = false 
or ^ true, false I = true 

union^^i.^g 5nj = SfUSz'J ■ ■ ■ uS„ 

filtenp5' = ix|x£5' Apfx'H (a finite setl 



TABLE 6. Primitive Intensional Operations 



Operator 


-Definition 


f@x 

img/s 

if -9)^ 

X7T 

nx 

if\\g)x 
f St 
if ®9 
ip^f-.g) 

curry/ 
uncurry/ 
<i>jD (d,r) 
iter[p 1 


\fx |xesj 

figx) 

7T.(x.) 

7T.(,X) 

ifx.gx) 

img[/||/]f 

ifx)igx) 

if px then f x else gx 

[/• ]£ 

f. a® u 

(d, fx[r T p.dJ-]) 

iv iterrp-^f 1; ld).f 



TABLE 7. Non-primitive Intensional Operations 



Operator 


Definition 


while [p ,/ ] 


p -> iter[p -»/ ]; Id 


/§i 


while[0?£.ca, (/.[Id||o] || f).c,;).A].(i.) 




while[n?£.a, l + ||/].(0,) 


~^f 


@.[ld||/] 




Id|| 


(5 


||Id 


n/ 


5[/.] 


extend(f ,/ ) 


edomf -> ti. f 


restrict(s,/) 


img[T3 . Id||/ . A]s 




-V. 



join, tie, composition, and restriction), this small set of operations is insufficient for 
general purpose programming. These remarks also apply to Childs’ reconstituted 
definition of relations [Childs69], which is also oriented towards data bases. Feldman 
and Rovner [Feldman69] augmented Algol with several relational operators for associa- 
tive access to a data base. Their operations, which are our plureJ description and 
image, are quite limited, being based on a traditional von Neumann language. 
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One general purpose language that does make extensive use of sets and relations is 
SETL [Kennedy?5]. It provides most of the familiar operations on sets (e.g., union, 
intersection, difference, powerset, image). SETL differs from relational programming 
in three significant respects: (l) it can only handle sets, (2) many operations 
must still be performed in a word-at-a-time fashion using the set former, and (3) it 
resorts to conventional control structures. 

Finally, we must mention logic programming systems, such as PROLOG [Kov/alski79, 
vanEmden?6], which use predicate logic to describe computational processes. These 
systems also differ from relational programming in two significant respects: (1) they 
have a word-at-a-time programming style due to the use of variables representing indi- 
viduals in the clauses of the program, and (2) they are implemented using a resolution 
theorem prover, whereas a more conventional procedural implementation suffices for 
relational programming. Essentially the same remarks apply to Popplestone’s rela- 
tional programming [Popplestone?9], which is like logic programming except that it 
uses "forwdird inference" rather than "backward inference." 

In summary, no other programming style that we are aware of combines the univer- 
sal use of relations with a rich set of operations on those relations that can be imple- 
mented in a deterministic, procedural v/ay. It is hoped that the proceeding discussion 
has made plausible some of the advantages claimed for relational programming in the 
Introduction. Considerable work remains to be done in evaluating the effectiveness cf 
a relational calculus as a programming tool. For instance, the optimum set of combi- 
nators and relational operators must be selected. Another non-trivial problem is the 
selection of a good notation for the relational calculus. More from convenience than 
conviction we have based our notation on [WhiteheadTO] and [CarnapbO]. Making rela- 
tional programming an effective tool wll require designing a notation that combines 
readability with the manipulative advantages of a tv/o-dimensional algebraic notation. 
This is all preliminary to any serioLis considerations of software or hardware implemen- 
tation techniques. 
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